Implementation and Performance Analysis of the φ-Failure Detector
نویسندگان
چکیده
Failure detection is a fundamental building block for ensuring fault tolerance in distributed systems. However, providing accurate and flexible failure detection in off-the-shelf distributed systems is difficult. Practical solutions to failure detection rely on some adaptive mechanism to cope with the unpredictability of networking conditions. However, while they provide reasonably good accuracy, they also lack the necessary flexibility to provide failure detection as a system-wide service. In particular, traditional solutions take a “one size fits all” approach, which prevents them from simultaneously supporting several distributed applications with very diverse QoS requirements. In this paper, we present a novel approach to adaptive failure detection, called φ-failure detector, which addresses the flexibility issue mentioned above. We describe an implementation, and analyze its behavior over intercontinental communication links during several weeks. Our experimental results show that our failure detector compares well with other known adaptive failure detection mechanisms, with the considerable advantage that it provides virtually limitless flexibility.
منابع مشابه
A performance study of the conceptual implementation of the GEM-tracking detector in Monte Carlo simulation
PANDA experiment (antiProton ANnihilation at DArmstadt) is one of the key projects of the future FAIR facilities to investigate the reactions of antiprotons with protons and nuclear targets. experiment is designed to serve as a completely extraordinary physical potential due to exploiting the availability of cold and high-intensity beams of antiprotons. One of the significant parts of the ...
متن کاملThe Φ Accrual Failure Detector
Detecting failures is a fundamental issue for fault-tolerance in distributed systems. Recently, many people have come to realize that failure detection ought to be provided as some form of generic service, similar to IP address lookup or time synchronization. However, this has not been successful so far. One of the reasons is the difficulty to satisfy several application requirements simultaneo...
متن کاملPerformance Analysis of Hierarchical Failure Detector
We present a new failure detector implementation. This implementation, a variant of the heartbeat failure detector, is both adaptable and designed for scalability. Its first specificity of our implementation lies in the fact that it is designed as a shared service among several applications by way of an adaptation layer between the failure detector and the application. This layer adapts the qua...
متن کاملCovariance Analysis of a vector tracking GPS receiver based on MMSE multiuser Detection
In high dynamic conditions, using vector tracking loops instead of scalar tracking loops in GPS receivers is proved as an efficient method to compensate the performance. The Minimum Mean Squared Error detector as a multiuser detector is applied in the vector tracking loop for more reliability and efficiency. The Kalman filter does the two tasks of tracking and extracting the navigation data aft...
متن کاملTwo-ways Adaptive Failure Detection with the φ-Failure Detector
It is widely recognized that distributed systems would greatly benefit from the availability of a generic failure detection service. Such a service can however prove useful only if it can adapt simultaneously to changing network conditions and conflicting application requirements. This paper presents a novel approach to adaptive failure detectors, called φ-failure detectors, which dynamically a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003